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Abstract 

In this paper we propose the use of (^-divergences as test statistics to ver- 
ify simple hypotheses about a one-dimensional parametric diffusion process 
dX t = b(Xt,9)dt + a(X t ,9)dWt, from discrete observations {X ti ,i = 
0, . . . , n} with U = iA n , i = 0, 1, . . . , n, under the asymptotic scheme 
A n — > 0, nA n — > oo and nA^ — ► 0. The class of (^-divergences is wide and 
includes several special members like Kullback-Leibler, Renyi, power and 
a-divergences. We derive the asymptotic distribution of the test statistics 
based on (^-divergences. The limiting law takes different forms depending 
on the regularity of <j). These convergence differ from the classical results for 
independent and identically distributed random variables. Numerical analy- 
sis is used to show the small sample properties of the test statistics in terms 
of estimated level and power of the test. 

keywords: diffusion processes, empirical level, hypotheses testing, ^-divergences, 
a-divergences 
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1 Introduction 



We consider the problem of parametric testing using ^-divergences. Let X be 
a r.v. and f(X,9) and g(X,9), 9 E 8 two families of probability densities 
on the same measurable space. The ^-divergences are defined as D^f.g) = 
E e <p (f(X)/g(X)), where Eg is the expected value with respect to P e , the true 
law of the observations. Because we focus the attention on the use of divergences 
for hypotheses testing, we will use a simplified notation: let 9 and 9 two points 
in the interior of 6 and define the divergence as 

In equation (11.11) the density {p(X, 9), 9 E 9} is a same family of probability 
densities and 0(-) is a function with the minimal property that 0(1) = 0. Examples 
of divergences of the form D a (9, 9 ) = D^ a (9, 9 ) are the a-divergences, defined 
by means of the following function 

4(1 -x 1 ^) 

<f>a{x) = -1 < a< 1 

1 — or 

Note that D a (9 , 9) = D^ a (9, 9 ). The class of a-divergences has been widely 
studied in statistics (see, e.g., Csiszar, 1967 and Amari, 1985) and it is a family of 
divergences which includes several members of particular interest. For example, 
in the limit as a — > —1, D_i(9, 9 ) reduces to the well-known Kullback-Leibler 
measure 

while as a — >• 0, the Hellinger distance (see, e.g., Beran, 1977, Simpson, 1989) 
emerges 

D (9, 9 Q ) = \e (y^Xj) - VRxJ 

As noticed in Chandra and Taniguchi (2006), the ct-divergence is also equivalent 
to the Renyi's divergence (Renyi, 1961) defined, for a E (0, 1), as 

R a (9,9 ) = -^— log E 0O (4^ 



a b U0 \p(X,9 ) 

from which is easy to see that in the limit as a —> 1, R a reduces to the Kullback- 
Leibler divergence. The transformation ip(R a ) = (exp{(a — l)R a — 1}/(1 — a) 
returns the power-divergence studied in Cressie and Read (1984). Liese and Va- 
jda (1987) provide extensive study of a modified version of R a and Morales et al. 
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(1997) consider divergences with convex </>(■) for independent and identically dis- 
tributed (i.i.d) observations; for example the power-divergences D < f >x (9, 9 ) with 

, . . x x - X(x - 1) - 1 ,„ „ N 

MX) = A(A-l) 

and A e R - {0, 1}. 

In this paper we focus our attention on the (^-divergences D^iO, 9 ), defined 
as in (11.11) . for one-dimensional diffusion process {X t , t E [0, T]}, solution of the 
following stochastic differential equation 

dX t = b(a,X t )dt + a((3,X t )dW t , X = x , (1.3) 

where W t is a Brownian motion, 9 = (a, (3) E Q a x Qp = 0, where a and Qp 
are respectively compact convex subset of IR P and M. q . We assume that the process 
X t is ergodic for every 9 with invariant law fj,g. Furthermore X t is observed at 
discrete times = iA n ,i = 0, 1,2, ...,n, where A n is the length of the steps. 
We indicate the observations with X n = {Xt.}o^i<„. The asymptotic is A n 
0, nA n — > oo and nA^ — >• as n — > oo. 

We study the properties of the estimated 0-divergence D^^X,,),^), for 
discretely observed diffusion processes, defined as 



/n(X n , ^ n (X n )) 
/n(X n , ^o) 



where /«(•,•) is the approximated likelihood proposed by Dacunha-Castelle and 
Florens-Zmirou (1986) and n (X n ) is any consistent, asymptotically normal and 
efficient estimator of 9. We prove that, for </>(•) functions which satisfying three 
different regularity conditions, the statistic converge weakly to three different 
functions of the xl+ q random variable. This result differs from the case of i.i.d. 
setting. 

Up to our knowledge the only result concerning the use of divergences for 
discretely observed diffusion process is due to Rivas et at (2005) where they 
consider the model of Brownian motion with drift dX t = adt + bdW t where 
a and b are two scalars. In that case, the exact likelihood of the observations 
is available in explicit form and is the gaussian law. Conversely, in the general 
setup of this paper, the likelihood of the process in (11.31) is known only for three 
particular stochastic differential equations, namely the Ornstein-Uhlembeck dif- 
fusion, the geometric Brownian motion and the Cox-Ingersoll-Ross model. In all 
other cases, the likelihood has to be approximated. We choose the approxima- 
tion due to Dacunha-Castelle and Florens-Zmirou (1986) and, to derive a proper 
estimator, we use the local gaussian approximation proposed by Yoshida (1992) 
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although our result holds for any consistent and asymptotically Gaussian estima- 
tor. This approach has been suggested by the work on Akaike Information Criteria 
by Uchida and Yoshida (2005). 

For continuous time observations from diffusion processes, Vajda (1990) con- 
sidered the model dX(t) = -b(t)X t dt + a(t)dW t ; Kuchler and S0rensen (1997) 
and Morales et al. (2004) contain several results on the likelihood ratio test statis- 
tics and Renyi statistics for exponential family of diffusions. Explicit deriva- 
tions of the Renyi information on the invariant law of ergodic diffusion processes 
have been presented in De Gregorio and Iacus (2007). For small diffusion pro- 
cesses, with continuous time observations, information criteria have been derived 
in Uchida and Yoshida (2004) using Malliavin calculus. 

The problem of testing statistical hypotheses from general diffusion processes 
is still a developing stream of research. Kutoyants (2004) and Dachian and Ku- 
toyants (2008) consider the problem of testing statistical hypotheses for ergodic 
diffusion models in continuous time; Kutoyants (1984) and Iacus and Kutoyants 
(2001) consider parametric and semiparametric hypotheses testing for small dif- 
fusion processes; Negri and Nishiyama (2007a, b) propose a non parametric test 
based on score marked empirical process for both continuous and discrete time 
observation from small diffusion processes further extended to the ergodic case in 
Masuda et al. (2008). Lee and Wee (2008) considered the parametric version of 
the same test statistics for a simplified model. 

Ait-Sahalia (1996, 2008), Giet and Lubrano (2008) and Chen et al. (2008) 
proposed tests based on the several distances between parametric and nonpara- 
metric estimation of the invariant density of discretely observed ergodic diffusion 
processes. The present paper complements the above references. 

The paper is organized as follows. Section [2] introduces notation and regular- 
ity assumptions. Section [3] states the main result. Section |4] contains numerical 
experiments to test the small sample performance of the proposed test statistics in 
terms of empirical level and empirical power under some alternatives. The proofs 
are contained in Section [5l 

2 Assumptions on diffusion model 

We consider the family of one-dimensional diffusion processes {X t , t E [0, T]}, 
solution to 

dX t = b(a, X t )dt + a(f3, X t )dW t , X = x , (2.1) 

where W t is a Brownian motion. Let 9 = (a, (3) G Q a x = G, where 
Q a and 6^ are respectively compact convex subset of R p and R c/ . Furthermore 
we assume that the drift function b : RxO Q — > R and the diffusion coefficient 
a : R x Qp — > R are known apart from the parameters a and (3. We assume that 
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the process X t is ergodic for every 9 with invariant law fig. The process X t is 
observed at discrete times t t = iA n , i — 0, 1, 2, n, where A„ is the length of 
the steps. We indicate the observations with X n = {X ti } ^n- The asymptotic is 
A n — > 0, nA n — > oo and nA^ — > as n — > oo. 

In the definition of the ^-divergence (11.11) the likelihood of the process is need, 
but as noted in the Introduction, this is usually not know. There are several ways to 
approximate the likelihood of a discretely observed diffusion process (for a review 
see, e.g., Chap. 3, Iacus, 2008). In this paper, we use the approximation proposed 
by Dacunha-Castelle and Florens-Zmirou (1986) although our result hold true 
(with some adaptations of the proofs) for other approximations, like, e.g. the 
one based on Hermite polynomial expansion by Ai't-Sahalia (2002). To write it 
in explicit way, we use the same setup as in Uchida and Yoshida (2005). We 
introduce the following functions 



The following set of assumptions ensure the good behaviour of the approximated 
likelihood and the existence of a weak solution of (12.11) 

Assumption 2.1. [Regularity on the process] 

i) There exists a constant C such that 



ii) mfp >x a 2 ([3,x) > 0. 

Hi) The process X is ergodic for every 9 with invariant probability measure fig. 
All polynomial moments of fig are finite. 

iv) For all m > and for all 0, sup 4 E\X t \ m < oo. 

v) For every 9, the coefficients b(a, x) and cr({3, x) are twice differentiate with 
respect to x and the derivatives are polynomial growth in x, uniformly in 9. 

vi) The coefficients b(a, x) and a([3, x) and all their partial derivatives respect 
to x up to order 2 are three times differentiable respect to 9 for all x in 
the state space. All derivatives respect to 9 are polynomial growth in x, 
uniformly in 9. 

Assumption 2.2. [Regularity for the approximation] 




b(a, x) 
cr(l3,x) 



a'(P,x) 
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B(x, 9) = B{s~\(3, x),9), h(x, 9) = B 2 (x, 9) + B'(x, 9) 



\b(a ,x) - b(a ,y) \ + \<r(f%,x) - a(f3 ,y)\ < C\x - y 
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i) h(x, 9) = 0(|x| 2 ) as x — > oo. 

ii) ini x h(x,9) > — oo for all 9. 
Hi) sup 9 sup^ \h 3 (x,9)\ < M < oo. 

iv) There exists 7 > such that for every 9 and j = 1,2, |.EP(x, 0)| = 
0(|£(x,0)| 7 ) as \x\ -> 00. 

Assumption 2.3. [Identifiability] The coefficients b(a, x) = b(a , x) and cr(/3, 2) = 
cr(/?o, x)for fj,g Q a.s. all x then a = a and (3 = (3 . 

Under As sumptions 12.11 and 12 .21 Dacunha-Castelle and Florens-Zmirou (1986) 
introduced the following approximation of transition density / of the process X 
from y to x at lag t 

f(x, y, t, 9) = -j==}— — exp j- - + H(x, y, 9) + tg(x, y, t 

(2.2) 

and its logarithm 

1 S 2 (r v 8) 

l(x,y,t,9) = - - log(27rt) - log a(y, (3) - 1 £ P) + H(x, y, 9) + tg(x, y, 9) 



where 



J X 



y du 



a{u,f3) 



9{x,y, 



H{x,y,9)= < - - \du 

= - l - |c(x, 9) + C(y, 61) + ±B(x, 9)B(y, 9) } 

C(x,e) = ^ 2 (x,e) + iB / (x,e)a(x,/3) 

The approximated likelihood and log-likelihood functions of the observations X„ 
become respectively 

n 

f n (X n ,9) = l[f(A n ,X u _ l ,X ti ,9) 

i=i 

n 



i=l 
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3 Construction of the test statistics and results 

Consider the divergence defined in (11.11) and let </>(■) be such that 0(1) = and, 
when they exist, define = 4>'(l) and = 0"(1). We consider three different 
setup 

Assumption 3.1. ^ is a finite constant depending only on <p and independent 
of9; 

Assumption 3.2. = and ^ is a finite constant depending only on <j) 
and independent of 9; 

Assumption 3.3. ^ and ^ are finite constants depending only on <f> 
and independent of 9; 

Remark 3.1. The above Assumptions are not so strong. In fact, for example 
the a-divergences D^ a (9,9 ) satisfy the Assumptions 1X71 and \33\ while for the 
power-divergences D^ x (9, 9 ) it's easy to verify that = 0'(1) = 0. 

Clearly, the quantity D^O^Oq) measures the discrepancy between 9 and the 
true value of the parameter 9 and is an ideal candidate to construct a test statistics. 
Let 9 n (K n ) be any consistent estimator of 9 and such that 

r- 1 /2(0 n (X n ) - 9 ) S N(0, X{9 Q y 1 ) (3.1) 



where Z(6* ) is the positive definite and invertible Fisher information matrix at #0 
equal to 

(Zb 3 (0o))k,j=i,..., P 

(Z^(0o)k;=i,...,« 

where 



1(9 



fc - f 1 db(a ,x) db(a ,x) 

J a 2 {(3 ,x) da k daj 

q-kj(n \ r> f 1 da(Po,x)da(Po,x) 

J (T 2 ((3 Q ,x) df3 k dfy 

We indicate with T the (p + q) x (p + q) matrix 

nA„ V 

-L 

and I p is the p x p identity matrix. Using the approximated likelihood / n (X n , 9) 
and / n (X n , 8q), the 0-divergence in (11.11) becomes 

7 



To construct a test statistics we replace 9 by the estimator 9 n (X. n ) and, having 
only one single observation of X n , i.e. only one observed trajectory, we estimate 
(TO) with 

DAW. flo) = * ( /w( /^(x!j ( ^) W)) ) (33) 

Please notice that, conversely to the i.i.d. case, there is no integral in the definition 
of (13.31) . We will discuss this point after the presentation of the Theorem 13.11 
The proposed test for testing H : 9 = 9 versus Hi : 9 ^ 9 is realized as 

%(0 n (X n ), O ) = versus E>^(0 n (X n ), O ) ± 0. 

Theorem 3.1. Under H : 9 = 9 , Assumptions 12. 7 IT231 convergence (13.11) , we 
/zave f/W 

z) if function <p{) satisfies Assumption ^. 1\ then 

D (0 re (X re ),0 o )^C^ +9 (3-4) 
n'J if function 4>{-) satisfies Assumption ^ '.21 then 

B (/> (9 n (X n ),9 Q )^ I ^Z p+q (3.5) 

w/zere v/Zp+g = Xp +g . 
z'z'z) if function (/)(•) satisfies Assumption ^ .3\ then 



>(0 n (X n ), O ) ^ ^(C^ + (O, + /^)^ P+g ) (3.6) 



Remark 3.2. /z^s c/ear that for = Ofrom (|3.61) we immediately reobtain the 
convergence result (13.51) . 

Remark 3.3. //" we consider the limits as a — > —lfor 4> a (x) of the a-divergences, 
i.e. we consider the Kullback-Leibler divergence, we have 

4>(x) = lim 4> a (x) = — log(x) 

Ol — * — 1 

for which = — 1 and = 1. In that case, (13.61) reduces to the standard result 
for the likelihood ratio test statistics. 

The convergence in Theorem 13. II may appear somewhat strange if one thinks 
about the usual results on ^-divergences for i.i.d. observations. The main differ- 
ence in diffusion models, is that our estimate of the divergence has not the usual 
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form of an expected value, i.e. it estimates the expected value with one observa- 
tion only. This is why, in the i.i.d case, the first term in the Taylor expansion of 

vanishes being the expected value of the score function, while in our case it 
remains only the score function which, as usual, converges to a Gaussian random 
variable. For the same reason, in the second term of the Taylor expansion, in the 
i.i.d. case appears the expected value of the second order derivative which con- 
verges to the Fisher information and, in our case, we have not the expected value, 
hence the convergence to the square of the x 2 emerges. 

If one wants to emulate the standard results for the i.i.d. case, it is still possible 
to work on the invariant density of the diffusion process. In that case, the cj>- 
divergence takes the usual form of the i.i.d. case because the invariant density 
have the explicit form. Indeed, let 

s(x J 9)=exp{-2 f m(x,6)- 



a 2 (y,9) a ) ' v ' ' a 2 (x,9)s(x,9) 

be the scale and speed functions of the diffusion, with x some value in the state 
space of the diffusion process. Let M = J m(x, 9)dx, then ir(x, 9) = m(x, 9) /M 
is the invariant density of the diffusion process. In this case, it is possible to define 
the 0-divergence as 

D*(0 n ,0 Q ) = j '</> ^^j) <x,9 )dx 
and the standard results follows. 

Remark 3.4. In our application, to derive and estimator, we consider further the 
local gaussian approximation of the same transition density (see, Yoshida, 1992) 

n 

g n (X n ,9) = Y,9n(A n ,X u _ 1 ,X u ,9) (3.7) 

i=l 

where 

1 , /„ x , /„ % \y — x — tb(a,x)] 2 
g(t, x, y, 9) = -- log(27rt) - loga(A x) 



2 &v ' *> v ' 2ta 2 (f3,x) 

The approximate maximum likelihood estimator 9 n (X. n ) based on (13.71) is then 
defined as 

n (X n ) = argsup5f n (X n ,6») (3.8) 
e 

Under the condition nA^ — > (see Theorem 1 in Kessler, 1997) the estima- 
tor 9 n (K n ) in (13.81) satisfies (13.11) . Hence, the result of Theorem \3. 1 1 applies for 
9 n (X. n ) = 6 n (K n ). 
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Remark 3.5. In Theorem I3.il there is no need to impose = and = 1 as, 
e.g. in Morales et al. (1997). Of course, in our case the constants and 
enter in the asymptotic distribution of the test statistics. The convergence result 
is also interesting because, contrary to the i.i.d case, the rate of convergence of 
the estimators of 9 for the drift and diffusion coefficients are different and are 
respectively equal to -\/nA n and \fn. 

Remark 3.6. As remarked in Uchida and Yoshida (2001 ), it is always better to de- 
rive approximate ML estimators and the test statistics on different approximations 
of the true likelihood to avoid circularities. 

4 Numerical analysis 

Although asymptotic properties have been obtained, what really matters in appli- 
cation is the behaviour of the test statistics under fine sample setup. We study the 
empirical performance of the test for small samples in terms of level of the test 
and power under some alternatives. In the analysis we consider the estimator (13.81) 
and the following quantities 

• estimated a-divergences 

D Q (0 n (x n )A) = <A 

with cj) a (x) = 4(1 - x^)/(l - a 2 ), with C a = ^ and = 1. We 
consider a E {-0.99, -0.90, -0.75, -0.50, -0.25, -0.10}; 

• estimated power-divergences 

o A (<? n (x„) A) = 0a 

with <f> x (x) = (x A+1 - x-X{x - 1))/(A(A + 1)), with C x = 0, K x = 1. 
We consider A e {-0.99, -1.20, -1.50, -1.75, -2.00, -2.50}; 

• likelihood ratio statistic 

B log (9 n (X n ),9 ) = - log ^fj^f ^ 



/n(X n , # n (X n )) 
/n(X n , 6> ) 



/n(X n , 9 n {X. n )) 
/n(X n , 9q) 
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For D Q and ©a* the threshold of the rejection region of the test are calculated using 
formula (13.61) as the empirical quantiles of (13.61) of 100000 simulations of the ran- 
dom variable xl+ q - For ©log is again used formula (13.61) but exact quantiles of the 
random variable xt +q are used. Because the interest is in testing = against 

7^ 0, whenever / n (X n , n (X n )) > _f n (X n , O ) we exchange the numerator and 
the denominator to avoid negative signs in the test statistics. Usually, this is not 
going to happen if <p is convex and 0'(1) = (see, e.g. Morales et ah, 1997). 

We evaluate the empirical level of the test calculated as the number of times 
the test rejects the null hypothesis under the true model, i.e. 



where 1 A is the indicator function of set A, M — 10000 is the number of simu- 
lations and c a is the (1 — a)% quantile of the proper distribution. Similarly we 
calculate the power of the test under alternative models as 



1 M 



A/[ / J _ l«V> c c«} 
1=1 

In our experiments we consider the two families of stochastic processes borrowed 
from finance 

• the Vasicek (VAS) model 

dX t = n(a - X t )dt + aXtdWt 

where, in finance, a is interpreted as volatility, a is the long-run equilib- 
rium value of the process and n is the speed of reversion. Let (n , a , a" 2 ) = 
(0.85837, 0.089102, 0.0021854), we consider three different sets of hypothe- 
ses for the parameters 

model 6 = (k, a, a 2 ) 



VAS (k , a , a 2 ) 
VASi (4-K ,a ,4-o-g) 
VAS 2 (iKo,ao,i-ffo) 

The interesting facts are that VAS , VASi and VAS 2 have all the same sta- 
tionary distributions N(a , a 2 /(2K )), a Gaussian transition density 

N (a + (x - « )e" Kt , - 
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and covariance function given by 



Gov(X s ,X t ) 




-«(s+i) 



2k(sA*)-1 



) 



and both show a strong dependency of the covariance as a function of k, 
which makes this model interesting in comparison with the i.i.d. setting; 

• the Cox-Ingersoll-Ross (CIR) model 



Let (kq, a , erg) = (0.89218, 0.09045, 0.032742), we consider different sets 
of hypotheses for the parameters 



This model has a transition density of x 2 -type, hence local gaussian approx- 
imation is less likely to hold for non negligible values of A„. 

The parameters of the above models, have been chosen according to Pritsker 
(1998) and Chen et al. (2008), in particular VASo corresponds to the model esti- 
mated by A'it-Sahalia (1996) for real interest rates data. 

We study the level and the power of the three family of test statistics for differ- 
ent values of A n e {0.1, 0.001} and n G {50, 100, 500}. For the same trajectory, 
hence we simulate 1000 observations and we extract only that last n observations. 
Disregarding the first part of the trajectory ensures that the process is in the sta- 
tionary state. 

The results of these simulations are reported in the Tables [TB We point out 
that in the Tables [2l|4l|7] and [9l in the column "model (a, n)" the a corresponds to 
the true level of the test used to calculate c a . The other en's in the first row of the 
tables correspond to the a in ^-divergences. 

Summary of the analysis for the Vasicek model It turns out that a-divergences 
are not very good in terms of estimated level of the test, but their power function 
behaves as expected. It also emerges that for A = —0.99, the power divergence 
cannot identify as wrong model VASi for small sample size n = 50 and A n = 
0.001 (Tabled row 2), although this is not the case for the power-divergences and 
the likelihood ratio test (Tables [2] and [Q row 2). 




model 9 = (k, a, a 2 ) 



CIR («o, CKo, Co) 

CIRi (5 • Kq, a , s ■ Oq) 
CIR 2 (| • Kq, Cv , \ ■ al) 
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In general power divergences for A in {—0.99, —1.20, —1.50, —1.75, —2.00} 
have always very small estimated level and high power under the selected alterna- 
tives. The a-divergences, do not behave very good and, the way they are defined, 
only approximate the likelihood ratio for a = —0.99. 

The power divergences are, on average, better than the likelihood ratio test in 
terms of both empirical level a and power (3 under the selected alternatives. 

Summary of the analysis for the CIR model The same average considerations 
apply to the case of CIR model. The difference is that, for small sample size, 
all test statistics have low power under the alternative CIRi while CIR 2 doesn't 
present particular problems. 



5 Proofs 

The following important Lemmas are useful to prove the Theorem l3.ll 

Lemma 5.1 (Kessler, 1997). Under the assumptions l2.ilT2"31 as nA\ — > the 
following hold true 

r l *Veg n (X n ,e ) ^ N(0,l(e )) (5.1) 

Lemma 5.2 (Uchida and Yoshida, 2005). Under the assumptions I2JH2.3I as 
nA^ — > the following hold true 

H V fl Z n (X n , 6 ) = H V^n(X n , ) + o p (l) (5.2) 

Lemma 5.3 (Uchida and Yoshida, 2005). Under the assumptions \2.1U.3\ as 
nA\ —> the following hold true 

Hv^(X„,^r^-Z(# ) (5.3) 

Proof of Theorem 1X71 We start by applying delta method. We denote the gradient 
vector by = [d/d9i], i = 1, . . . ,p + q and similarly the Hessian matrix by 

v 2 e = [d z /de i do j ],i,j = i,..., P + q. 

i) We can write that 

D (#„(x n )A) = iM^oA) + [v e U)4e ,e )} T (e n (x n )-e ) + o p (i) 

= [V e %(9o, 6 )} T (6 n (X n ) - 9 Q ) + o p (l) 
because ©^(^o, 9 ) = 0. Noting that for k — 1, ...,p + q 



_d_ 



fn(;9) 

fn(-, Oq) 



fn(-,e)\ df n 



f n (;0 o y \fn(-,e )j d6, 
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by Assumption ^ . 1 1 follows that 

VgO^o, 9 ) = C</> Vg/ n (X n , O)\ 0=0q = C<j>V gl n (X n , 6>o ) 
and therefore 



(X n ), # ) — C<f 



r5v e z„(x n ,e ) 



T 



r-2(^(x n )-0 o ) + Op (l) (5.4) 



From (15.41) by means of Lemma I5T21I5 . 1 1 and Slutsky's Theorem immediately fol- 
lows 



ii) Since for fc, j = 1, ...,p + q 



2 



dOkdOj 



follows that 



fn 



fn(-, do) 



1 fn(;8)\ df n {;6)df n {;6) 



1 jj ffn(;0) \ d 2 f n (;6) 



xT-^ 2 (e n (x n )-e ) + o p (i) 



[r- 1/2 te) - M T r 1/2 v*ux n A)[r 1/2 v e z n (x n ,0 o 
xr- 1 /2(^( Xn )-^ ) + Op (i) 



From (15.41) by means of Lemma I5T21I5 . 1 1 and Slutsky's Theorem immediately fol- 
lows 



p+q 



It's easy to verify that the density function of the r.v. Z p+q is equal to 



Jz p+q {z) - YjmyVZ e z > U 

ra) By previous considerations we have that 

D^ n (X n )A) = [V,D^oA)] T (#n(X n )-# ) 



(5.5) 



+±[r- 1 / 2 (9 n (x n ) - ^ )] T r 1 / 2 v^(^o^o)r 1/2 

xr-V 2 (6 n (X n )-6 ) + o p (l) (5.6) 
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where 



VpVfloA) 

= K^VeUXn, 9 )[V e lnOLn, )f + 1 . Vg/(X», O ) 

= (J^ + C ) V,/ n (X n , O ) [V e / n ,(X„, O )] T + C V^/ n (X n , O ) (5.7) 

Plugging in (15.61) the quantity (15.71) we derive, applying again Lemma l5Tll5.3[ the 
following result 

□ 

Conclusions 

It seems that, as in the i.i.d. case, also for discretely observed diffusion processes 
the ^-divergences may compete or improve the performance of the standard like- 
lihood ratio statistics. In particular, the power divergences are in general quite 
good in terms of estimated level and power of the test even for moderate sample 
sizes (e.g. n > 100 in our simulations). 

The package sde for the R statistical environment (R Development Core 
Team, 2008) and freely available at |http : //cran . R-Pro ject . org] con- 
tains the function sdeDiv which implements the 0-divergence test statistics. 
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model (n) 


a = 0.01 


a = 0.05 


VAS (50) 


0.01 


0.04 


VASi (50) 


1.00 


1.00 


VAS 2 (50) 


1.00 


1.00 




01 


04 


VASi (100) 


1.00 


1.00 


VAS 2 (100) 


1.00 


1.00 


VASn (500) 


0.01 


0.07 


VASi (500) 


1.00 


1.00 


VAS 2 (500) 


1.00 


1.00 


model (n) 


a = 0.01 


a = 0.05 


VAS (50) 


0.01 


0.04 


VASi (50) 


1.00 


1.00 


VAS 2 (50) 


1.00 


1.00 


VAS (100) 


0.01 


0.04 


VASi (100) 


1.00 


1.00 


VAS 2 (100) 


1.00 


1.00 


VAS (500) 


0.00 


0.02 


VASi (500) 


1.00 


1.00 


VAS 2 (500) 


1.00 


1.00 



Table 1: Numbers represent probability of rejection under the true generating 
model, with c a calculated under H . Therefore, the values are a under model "0" 
and (3 otherwise. Estimates calculated on 10000 experiments. Likelihood ratio, 
for A n = 0.001 (up) and A n = 0.1 (bottom). 
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model (a, n) 


a = -0.99 


a = -0.90 


a = -0.75 


a = -0.50 


a = -0.25 


a = -0.10 


VAS (0.01,50) 


0.01 


0.10 


0.39 


0.62 


0.73 


0.77 


VASi (0.01,50) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.01,50) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.05, 50) 


0.04 


0.12 


0.39 


0.62 


0.73 


0.77 


\T\ C /A AC CA\ 

VASi (0.05, 50) 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


VAS 2 (0.05, 50) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.01, 100) 


0.01 


0.10 


0.39 


0.63 


0.74 


0.78 


VASi (0.01, 100) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.01, 100) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.05, 100) 


0.04 


0.11 


0.40 


0.63 


0.74 


0.78 


\T \ C /A AC 1 AA\ 

VASi (0.05, 100) 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


VAS 2 (0.05, 100) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.01,500) 


0.02 


0.18 


0.61 


0.83 


0.90 


0.92 


VASi (0.01,500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.01,500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.05, 500) 


0.07 


0.20 


0.61 


0.83 


0.90 


0.92 


VASi (0.05, 500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.05, 500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 



Table 2: Numbers represent probability of rejection under the true generating model, with c a calculated under H . Therefore, 
the values are a under model "0" and (3 otherwise. Estimates calculated on 10000 experiments, a-divergences, for A n = 
0.001. 



model (a, n) A = -0.99 A = -1.20 A = -1.50 A = -1.75 A = -2.00 A = -2.50 



VAS (0.01,50) 


0.00 


0.00 


0.00 


0.01 


0.02 


0.04 


VASi (0.01,50) 


0.00 


0.99 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.01,50) 


0.40 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.05, 50) 


f\ f\f\ 

0.00 


f\ f\f\ 

0.00 


f\ f\f\ 

0.00 


0.01 


0.03 


0.06 


VASi (0.05, 50) 


0.67 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.05, 50) 


0.99 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.01, 100) 


0.00 


0.00 


0.00 


0.01 


0.02 


0.04 


VASi (0.01, 100) 


0.23 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.01, 100) 


0.88 


1.00 


1.00 


1.00 


1.00 


1.00 


"\ 7"A O ff\ f\d 1 f\f\\ 

VAS (0.05, 100) 


0.00 


r\ f\f\ 

0.00 


f\ f\f\ 

0.00 


0.01 


0.03 


0.06 


VASi (0.05, 100) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.05, 100) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.01,500) 


0.00 


0.00 


0.00 


0.01 


0.03 


0.08 


VASi (0.01,500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.01,500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.05, 500) 


0.00 


0.00 


0.01 


0.03 


0.06 


0.12 


VASi (0.05, 500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.05, 500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 



Table 3: Numbers represent probability of rejection under the true generating model, with c a calculated under H . Therefore, 
the values are a under model "0" and (3 otherwise. Estimates calculated on 10000 experiments. Power-divergences for 
A n = 0.001 



model (a, n) 


a = -0.99 


a = -0.90 


a = -0.75 


a = -0.50 


a = -0.25 


a = -0.10 


VAS (0.01,50) 


0.01 


0.15 


0.55 


0.78 


0.86 


0.88 


VASi (0.01,50) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.01,50) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.05, 50) 


0.05 


0.17 


0.55 


0.78 


0.86 


0.88 


\T\ C /A AC CA\ 

VASi (0.05, 50) 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


VAS 2 (0.05, 50) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.01, 100) 


0.01 


0.13 


0.48 


0.71 


0.80 


0.83 


VASi (0.01, 100) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.01, 100) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.05, 100) 


0.04 


0.15 


0.48 


0.71 


0.80 


0.83 


\T \ C /A AC 1 AA\ 

VASi (0.05, 100) 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


1 AA 

1.00 


VAS 2 (0.05, 100) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.01,500) 


0.00 


0.06 


0.25 


0.54 


0.69 


0.74 


VASi (0.01,500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.01,500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.05, 500) 


0.02 


0.07 


0.25 


0.54 


0.69 


0.74 


VASi (0.05, 500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.05, 500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 



Table 4: Numbers represent probability of rejection under the true generating model, with c a calculated under H . Therefore, 
the values are a under model "0" and (3 otherwise. Estimates calculated on 10000 experiments, a -divergences, for A n = 0.1 



model (a, n) A = -0.99 A = -1.20 A = -1.50 A = -1.75 A = -2.00 A = -2.50 



VAS (0.01,50) 


0.00 


0.00 


0.00 


0.01 


0.02 


0.05 


VASi (0.01,50) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.01,50) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.05, 50) 


f\ f\f\ 

0.00 


f\ f\f\ 

0.00 


f\ f\f\ 

0.00 


0.02 


0.03 


0.09 


VASi (0.05, 50) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.05, 50) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.01, 100) 


0.00 


0.00 


0.00 


0.00 


0.01 


0.05 


VASi (0.01, 100) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.01, 100) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


"\ 7"A O ff\ f\d 1 f\f\\ 

VAS (0.05, 100) 


f\ f\f\ 

0.00 


r\ f\f\ 

0.00 


f\ f\f\ 

0.00 


0.01 


0.03 


0.08 


VASi (0.05, 100) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.05, 100) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.01,500) 


0.00 


0.00 


0.00 


0.00 


0.01 


0.02 


VASi (0.01,500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.01,500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS (0.05, 500) 


0.00 


0.00 


0.00 


0.01 


0.01 


0.04 


VASi (0.05, 500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


VAS 2 (0.05, 500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 



Table 5: Numbers represent probability of rejection under the true generating model, with c a calculated under H . Therefore, 
the values are a under model "0" and (3 otherwise. Estimates calculated on 10000 experiments. Power-divergences for 
A n = 0.1 





\ n ) 


IX — w . w ± 


n — 0^ 

IX — W . W<J 


CIRn 


(50) 


0.02 


0.11 


CTR, 


("50^ 


59 


R4 






1 00 

1 . \J\J 


1 00 


CIRo 


(100) 


0.03 


0.11 


1 1\ ] 


1 \J\J ) 


96 


99 


CTRn 




1 00 

1 ,\J\J 


1 00 

1 .\J\J 


CIRo 


(500) 


0.02 


0.09 


CTR, 

1 1\ [ 


f500 N ) 


1 00 


1 00 


CTRo 


f500 N ) 


1 00 


1 00 


I11VJU.C 


(x\\ 
W 


(X — U.U 1 


n — n f)^ 

Ct — U.UJ 


CIRo 


(50) 


0.01 


0.04 


CIRi 


(50) 


0.78 


0.93 


CIR 2 


(50) 


1.00 


1.00 


CIRo 


(100) 


0.01 


0.04 


CIRx 


(100) 


0.99 


1.00 


CIR 2 


(100) 


1.00 


1.00 


CIRo 


(500) 


0.00 


0.02 


CIR! 


(500) 


1.00 


1.00 


CIR 2 


(500) 


1.00 


1.00 



Table 6: Numbers represent probability of rejection under the true model, with 
rejection region calculated under H . Likelihood ratio, for A n = 0.001 (up) and 
A n = 0.1 (bottom). 
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model (a, n) a = -0.99 a = -0.90 a = -0.75 a = -0.50 a = -0.25 a = -0.10 



CIR (0.01,50) 


0.03 


0.28 


0.69 


0.83 


0.89 


0.90 


CIRi (0.01,50) 


0.63 


0.95 


1.00 


1.00 


1.00 


1.00 


CIR 2 (0.01,50) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


CIR (0.05, 50) 


0.12 


0.31 


0.69 


0.83 


/\ OA 

0.89 


0.90 


CIRi (0.05, 50) 


0.86 


0.96 


1.00 


1.00 


1.00 


1.00 


CIR 2 (0.05, 50) 


1.00 


l.OO 


1.00 


1.00 


1.00 


1.00 


CIR (0.01, 100) 


0.03 


0.28 


0.69 


0.85 


0.89 


0.91 


CIRi (0.01, 100) 


0.97 


l.OO 


1.00 


1.00 


1.00 


1.00 


CIR 2 (0.01, 100) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


/~itt"» / f\ f\a 1 f\f\\ 

CIR (0.05, 100) 


0.12 


0.31 


0.69 


0.85 


0.89 


0.91 


CIRi (0.05, 100) 


0.99 


1.00 


1.00 


1.00 


1.00 


1.00 


CIR 2 (0.05, 100) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


CIR (0.01,500) 


0.03 


0.22 


0.59 


0.79 


0.86 


0.88 


CIRi (0.01,500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


CIR 2 (0.01,500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


CIR (0.05, 500) 


0.09 


0.24 


0.59 


0.79 


0.86 


0.89 


CIRi (0.05, 500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


CIR 2 (0.05, 500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 



Table 7: Numbers represent probability of rejection under the true model, with rejection region calculated under H . 
divergences, for A n = 0.001 



mode 


(a, n) 


A = -0.99 


A = -1.20 


A = -1.50 


A = -1.75 


A = -2.00 


A = -2.50 


CIR 


(0.01, 50) 


0.00 


0.00 


0.00 


0.02 


0.05 


0.13 


CIRi 


(0.01, 50) 


0.00 


0.01 


0.28 


0.55 


0.71 


0.87 


CIR 2 


(0.01, 50) 


0.00 


0.81 


1.00 


1.00 


1.00 


1.00 


CIR 


(0.05, 50) 


0.00 


0.00 


0.01 


0.04 


0.09 


0.19 


CIRi 


(0.05, 50) 


A AA 

0.00 


A AA 

0.09 


A A O 

0.48 


A "7 A 

0.70 


A O 1 

0.81 


A A^ 

0.92 


CIR 2 


(0.05, 50) 


0.00 


0.97 


1.00 


1.00 


1.00 


1.00 


CIR 


(0.01, 100) 


0.00 


0.00 


0.00 


0.02 


0.05 


0.14 


CIRi 


(0.01, 100) 


0.00 


0.21 


0.83 


0.95 


0.98 


0.99 


CIR 2 


(0.01, 100) 


0.00 


1.00 


1.00 


1.00 


1.00 


1.00 


CIR 


(0.05, 100) 


0.00 


0.00 


0.01 


0.05 


0.09 


0.20 


rim 

CIRi 


(0.05, 100) 


A AA 

0.00 


A CO 

0.53 


A AO 

0.93 


A AO 

0.98 


A AA 

0.99 


1 A A 

1.00 


CIR 2 


(0.05, 100) 


0.24 


1.00 


1.00 


1.00 


1.00 


1.00 


CIRo 


(0.01,500) 


0.00 


0.00 


0.00 


0.02 


0.04 


0.10 


CIRi 


(0.01, 500) 


0.00 


1.00 


1.00 


1.00 


1.00 


1.00 


CIR 2 


(0.01,500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


CIRo 


(0.05, 500) 


0.00 


0.00 


0.01 


0.04 


0.07 


0.15 


CIRi 


(0.05, 500) 


0.95 


1.00 


1.00 


1.00 


1.00 


1.00 


CIR 2 


(0.05, 500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 



Table 8: Numbers represent probability of rejection under the true model, with rejection region calculated under H . Power- 
divergences for A n = 0.001 



model (a, n) a = -0.99 a = -0.90 a = -0.75 a = -0.50 a = -0.25 a = -0.10 



CIR (0.01,50) 


0.01 


0.14 


0.54 


0.77 


0.85 


0.87 


CIRi (0.01,50) 


0.80 


0.98 


l.OO 


l.OO 


l.OO 


l.OO 


CIR 2 (0.01,50) 


1.00 


1.00 


l.OO 


l.OO 


l.OO 


l.OO 


CIR (0.05, 50) 


0.05 


A 1 / 

0.16 


0.54 


0.77 


0.85 


0.87 


CIRi (0.05, 50) 


0.94 


0.98 


l.OO 


l.OO 


l.OO 


l.OO 


CIR 2 (0.05, 50) 


1.00 


1.00 


l.OO 


l.OO 


l.OO 


l.OO 


CIR (0.01, 100) 


0.01 


0.13 


0.49 


0.7 1 


0.79 


0.82 


CIRi (0.01, 100) 


0.99 


1.00 


l.OO 


l.OO 


l.OO 


l.OO 


CIR 2 (0.01, 100) 


1.00 


1.00 


l.OO 


l.OO 


l.OO 


l.OO 


/~itt"» / f\ f\a 1 f\f\\ 

CIR (0.05, 100) 


0.04 


0.15 


0.49 


0.7 1 


0.79 


0.82 


CIRi (0.05, 100) 


1.00 


l.OO 


l.OO 


l.OO 


l.OO 


l.OO 


CIR 2 (0.05, 100) 


1.00 


l.OO 


l.OO 


l.OO 


l.OO 


l.OO 


CIR (0.01,500) 


0.00 


0.06 


0.28 


0.54 


0.69 


0.74 


CIRi (0.01,500) 


1.00 


l.OO 


l.OO 


l.OO 


l.OO 


l.OO 


CIR 2 (0.01,500) 


1.00 


l.OO 


l.OO 


l.OO 


l.OO 


l.OO 


CIR (0.05, 500) 


0.02 


0.08 


0.28 


0.54 


0.69 


0.74 


CIRi (0.05, 500) 


1.00 


l.OO 


l.OO 


l.OO 


l.OO 


l.OO 


CIR 2 (0.05, 500) 


1.00 


l.OO 


l.OO 


l.OO 


l.OO 


l.OO 



Table 9: Numbers represent probability of rejection under the true model, with rejection region calculated under H . 
divergences, for A n = 0.1 



mode 


(a, n) 


A = -0.99 


A = -1.20 


A = -1.50 


A = -1.75 


A = -2.00 


A = -2.50 


CIR 


(0.01, 50) 


0.00 


0.00 


0.00 


0.01 


0.02 


0.06 


CIRi 


(0.01, 50) 


0.00 


0.06 


0.52 


0.75 


0.86 


0.94 


CIR 2 


(0.01, 50) 


0.00 


0.99 


1.00 


1.00 


1.00 


1.00 


CIR 


(0.05, 50) 


0.00 


0.00 


0.00 


0.02 


0.04 


0.09 


CIRi 


(0.05, 50) 


A AA 

0.00 


AT? 

0.23 


A 1A 

0.70 


A O C 

0.85 


A AO 

0.92 


0.96 


CIR 2 


(0.05, 50) 


0.06 


1.00 


1.00 


1.00 


1.00 


1.00 


CIR 


(0.01, 100) 


0.00 


0.00 


0.00 


0.01 


0.02 


0.05 


CIRi 


(0.01, 100) 


0.00 


0.56 


0.96 


0.99 


1.00 


1.00 


CIR 2 


(0.01, 100) 


0.00 


1.00 


1.00 


1.00 


1.00 


1.00 


CIR 


(0.05, 100) 


0.00 


0.00 


0.00 


0.02 


0.03 


0.08 


rim 

CIRi 


(0.05, 100) 


A AA 

0.00 


A O O 

0.83 


A AA 

0.99 


1 A A 

1.00 


1 A A 

l.OO 


1 A A 

1.00 


CIR 2 


(0.05, 100) 


0.97 


1.00 


1.00 


1.00 


l.OO 


1.00 


CIRo 


(0.01,500) 


0.00 


0.00 


0.00 


0.00 


0.01 


0.02 


CIRi 


(0.01, 500) 


0.00 


1.00 


1.00 


1.00 


1.00 


1.00 


CIR 2 


(0.01,500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


CIRo 


(0.05, 500) 


0.00 


0.00 


0.00 


0.01 


0.02 


0.04 


CIRi 


(0.05, 500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


CIR 2 


(0.05, 500) 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 



Table 10: Numbers represent probability of rejection under the true model, with rejection region calculated under H . 
Power-divergences for A n = 0.1 



